Base Pair Openings and Temperature Dependence of DNA Flexibility 
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The relationship of base pair openings to DNA flexibility is examined. Published experimental 
data on the temperature dependence of the persistence length by two different groups are well 
described in terms of an inhomogeneous Kratky-Porot model with soft and hard joints, corresponding 
to open and closed base pairs, and sequence-dependent statistical information about the state of 
each pair provided by a Peyrard-Bishop-Dauxois (PBD) model calculation with no freely adjustable 
parameters. 
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The bending flexibility of DNA is essential in key bi- 
ological processes, such as its packing in chromatin or 
in viruses. While single strands of DNA (or RNA) are 
highly flexible owing to an almost free rotation around 
single chemical bonds, the double helical structure with 
its hydrogen-bonded stacked base pairs favors a very rigid 
configuration. One may ask what are the natural limits 
of this rigidity. This question has been addressed almost 
since the discovery of DNA structure and has led to con- 
troversial results. 

A measure of the flexibility of the molecule is pro- 
vided by its persistence length, which is the correlation 
length for the thermal fluctuations of a vector tangent to 
the helix axis. Early measurements were made by light 
scattering methods The temperature dependence of 
persistence length was measured by Gray and Hearst Q 
in sedimentation experiments. The analysis of these re- 
sults by a model that describes DNA flexible rod, 
the so-called wormlike chain (WLC) model, led the au- 
thors to suggest that a statistically homogeneous WLC 
description did not apply, and to suggest that, instead of 
a smooth bending, the flexibility could come from sharp 
bends, at "flexible joints" . This appeared consistent with 
the observation of "DNA breathing" by von Hippel and 
coworkers A denaturation of the double helix, form- 
ing local single strands, would provide those "joints". 
Another hint of the possibility of sharp bends came from 
cyclization measurements of short chains Q which gave 
a much higher probability to find the two ends of a short 
DNA molecule next to each other than predicted by a 
homogeneous WLC model leading to smooth bendings. 
A theoretical analysis Q showed that a localized single- 
stranded bubble mechanism was a possible explanation, 
compatible with the energetics of the thermal fluctua- 
tions of DNA. However, the interpretation of cyclization 
experiments was criticized in later work Q identifying 
the high ligase protein used in 0] as the source of a pos- 
sible artefact. 

As the local opening of DNA can be thermally induced, 



a study of the temperature dependence of the persistence 
length should provide a clue. In fact recent measurements 
performed between 5 and 60 C reveal a much stronger 
variation of the persistence length than that cither found 
in earlier work |2| or predicted by a WLC model with a 
fixed stiffness constant. The authors of Ref. ruled out 
the possibility that local openings could be responsible 
for this behavior on the basis of the low opening proba- 
bilities derived from the rate of exchange of the protons 
involved in the base-pair hydrogen bonds when DNA is 
in solution in deuterated water [8[ . 

A promising experimental route to resolve the issue of 
whether local openings are relevant to increased DNA 
flexibility would be to examine the strongly fluctuat- 
ing premelting and melting regimes. It turns out that 
this was done many years ago in a pioneering study of 
magnetic birefringence of DNA solution by Maret and 
coworkers In this work we interpret the results of 
Q in terms of base-pair openings whose populations can 
be determined by a lattice-dynamically motivated meso- 
scopic model of DNA. Furthermore, we apply these ideas 
to a direct calculation of the average end-to-end distance 
which allows us to comparcpredicted effective persistence 
length with the results of [7| . Our analysis of the two ex- 
periments in terms of standard model ideas of heteroge- 
nous polymer elasticity and DNA bubbles describes the 
full temperature dependence of the two experimental re- 
sults and suggests that a quantitative link between base 
pair openings and local flexibility can be detected in mea- 
sured properties of DNA solutions. 

We model the elastic behavior of a DNA chain of N 
base pairs in a magnetic field B along the z axis in terms 
of a heterogeneous Kratky-Porod chain [l(| 
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where Sj is the unit vector describing the direction of the 
jth base-pair plane. The first term is simply a discrete 
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FIG. 1. (color online) The quantity Q (left y-axis), as ex- 
tracted from measurements of magnetic birefringence (circles, 
adapted from [jj), and calculated (i) on the basis of Eq. 
(dashed curve), (ii) from the heterogeneous KP model (full 
line); the dotted curve shows a 1/T behavior for comparison. 
Also shown (right y-axis) is the measured melting fraction for 
a similar sample yj| (dotted line) and as calculated from a 
290kb sequence of 60s taurus [3| (thin full line). 



version of the WLC model, with a local stiffness constant 
Jj.j+i ■ The second term describes the contribution of the 
magnetic energy. A^ = X11 — X± is the anisotropy of the 
magnetic susceptibility of a single base pair, known to be 
negative (diamagnetic) in a fixed molecular frame where 
the parallel direction is along the axis of the helix. The 
(Sj) 2 = cos 2 6j factor follows from the Euler (6,<p) ro- 
tation which transforms the susceptibility tensor to the 
laboratory frame if one integrates over all possible az- 
imuthal angles (axial symmetry) . The stiffness constants 
will be taken as cither J (stiff) or J' (soft) with an ap- 
propriate probability distribution (cf below). 

The magnetically induced optical birefringence An = 
nil — n± of the refractive index along directions paral- 
lel and perpendicular to the field, measured in |9|, is 
produced by the difference in the (optical frequency) 
dielectric constant which originates in the (electrically) 
anisotropically polarizable units, the DNA base pairs. 
Standard electromagnetic theory implies 
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where n is the average index of refraction, Aa = an —a±_ 
the anisotropy of the electronic polarizability tensor of a 
single base pair in the molecular frame, p the number of 
monomers (base pairs) per unit volume, and 
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is an averaged orientational factor whose origin can be 
traced, in analogy with the second term in the magnetic 



Hamiltonian ([I]), to the anisotropy of the axially sym- 
metric pola rizability tensor in the laboratory frame. Fol- 
lowing [11[ we consider any local field corrections to be 
incorporated in effective values of the polarizability. The 
thermal averaging is performed over conformations of the 
molecule according to ([1} . Since, even for high magnetic 
fields, Ax-B 2 is much smaller than the typical thermal 
energies fcsT, the averaging of the orientational factor 
can be evaluated to leading order in B 2 
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where 
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and the subscript implies averaging over field-free con- 
figurations of the KP chain ([1]). For a homogeneous KP 
chain of v monomers the value of Q is [l2| 
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where u = 1 - (3/K)(cothK - 1/K), K = J/k B T. For 
the case of DNA, the orientational averages are computed 
in an heterogeneous KP ensemble where the stiffness con- 
stants depend on whether the molecule is locally in a 
double-stranded or open configuration 
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Pq 3+ is the joint probability that base pairs (J) and 
(j + 1) are open. It is obtained from an exact calculation 
of the partition function of the mesoscopic PBD Hamil- 
tonian (T3 |. which associates a transverse coordinate y n 
with each base pair and is known to describe the thermal 
denaturation of long DNA chains with known sequence 
and salt content, with the method described in Ref. 14 1. 



The model parameters, determined in earlier analysis of 
melting profiles [l5| have not been adjusted for this study. 

Fig. Q] compares theoretical predictions as outlined 
above with the experimental data on calf thymus DNA. 
The experimental values of the dimensionless quantity 
Q have been extracted from the measured birefringence 
values [j| using a mean index of refraction n = 1.33 
and the values Ay = -1.55 x 1CT 20 erg/T 2 
Aa = —18.2 A 3 17 1 for the anisotropics, respectively, 
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of the diamagnetic susceptibility and electronic polariz- 
ability per base pair. The parameter J has been set to 
J = 7.02 x 10~ 12 erg which corresponds to a room tem- 
perature persistence length of 59 nm. Consistently with 
the parameters of the PBD model which introduce a 
ratio of 50 between the base pair interactions for stacked 
or unstacked bases, we set J' = J/50 = 0.14 x 10~ 12 erg, 
which gives a persistence length of 1.87 nm for single- 
stranded DNA. For reference purposes, we show on Fig.[T] 
the experimental calf thymus melting profile [l8j and a 
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PBD-modcl computed profile from a 290 kbp long seg- 
ment of bovine chromosome with a similar melting tem- 
perature [l9j]. The computed profile has been obtained 
with the mo del p arameters and the numerical procedure 
described in 
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To complete this study we would like to present an al- 
ternative way of computing the dimensionless quantity Q 
using PBD model input on base pair openings, following 
the analysis used recently in the interpretation of neu- 
tron scattering experiments on B-DNA [20j | . Although 
slightly less accurate than the previous numeric evalua- 
tion, the alternative route illustrates some of the under- 
lying properties of open vs. closed base pairs. According 
to that picture, neutron diffraction - and in the present 
case magnetic birefringence - is produced from oriented 
intact clusters of bound base pairs. An intact cluster of 
length v (defined as enclosed by open base pairs on both 
sides) occurs with a probability P v which satisfies the 
sum rules Q Y^=q p v = ®i Y^T=i vP v = 1-6, where 
is the overall fraction of open base pairs. It follows 
that the average size of intact clusters (measured in base 
pairs) 
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can be determined in terms of 6 and the "zero size clus- 
ter probability" Pq (cf. Fig. [5]); note that the latter 
quantity is in fact the joint probability of two successive 
base pairs being in the open state. Moreover, the P v 's 
behave exponentially, i.e., P v = Ae~ av ,v > 1, where 
A = (1 - 9)/[s(s -1)1 and a = - ln(l - 1/s), to a high 
degree of accuracy 0, [2(| . 

It is therefore possible, in an approximate sense which 
neglects contributions to birefringence from open base 
pairs, to evaluate Q as an average over all intact clusters, 
treated as if they were freely jointed 
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where it should be recognized that for typical ds-DNA 
parameters, K >> 1 is very nearly equal to the persis- 
tence length A expressed in units of the monomer dis- 
tance a = 0.34 nm. The dashed curve in Fig. Q] expresses 
Eq. [7] for the same J = 7.02 x 10~ 12 erg as above. The 
agreement between the theoretical predictions and the 
magnetic birefringence experiments (Fig.[lJ, in the whole 
temperature range from room to melting temperatures, 
validates this approach which implies that fiuctuational 
openings and persistence length of DNA are intimately 
linked, even at temperatures significantly below melting. 

We now turn our attention to the more recent set of 
persistence length measurements which were based on cy- 
clization of A-phage DNA fragments Q . The overall tem- 
perature dependence of the persistence length appears 
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FIG. 2. (color online) Some average properties of the bovine 
sequence [7j|. Left y-scale: the average size (in bps) of in- 
tact clusters (dash-dotted curve); the inverse of the melting 
fraction (dotted curve). Right j/-scale: the melting fraction 
O (solid curve) and the conditional probability Pq/O that if 
a base pair is in the open state, the next one will also be in 
the open state (dashed curve); note that the latter quantity 
is much larger than S (bubble aggregation tendency). 



(Fig. [3]) to be stronger than the 1 /T predicted by the ho- 
mogeneous KP model. In what follows we will show that 
such a strong temperature dependence is entirely consis- 
tent with the bubble-based scenario proposed above for 
the description of the birefringence data. We consider 
a heterogeneous KP model with stiffness constants J or 
J' determined with the probabilities given by the PBD 
model (using Eq. (|5]), J' = 0.14 x 10~ 12 erg and slightly 
varying J). It can be used to compute a theoretical end- 
to-end distance, based on the bubble-based scenario, for 
the 200 bp sequence of the A— phage for a Na + concen- 
tration of 0.004 M (cf. Q), from 
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A subsequent analysis based on the homogeneous WLC 
model as in 0, HH, with the same total length L = Na, 
i.e. with 



(R 2 ) = 2£A-2A 2 (l-e- L/A ) 
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allows us to extract an effective persistence length which 
can be directly compared with the experimental data. 
The resulting effective persistence length, obtained for 
J = 6.13 x 10 -12 erg, plotted in Fig. [31 shows that the 
data of are consistent with a bubble-based analysis. 

We have shown that the results of the two experiments 
which measured DNA persistence length in a tempera- 
ture range sufficiently broad to detect deviations from a 
1 /T behavior can be fitted by adopting a single approach 
combining a discrete version of the WLC model with the 



4 



60 



40 




=^=41 



20- 







0.1 



0.01 
1E-3 



20 



80 



40 60 
T(C) 

FIG. 3. (color online) Persistence length obtained by cycliza- 
tion of DNA fragments (filled squares, open circles, redrawn 
from ref. ). The solid curve shows the estimate obtained 
from the inhomogeneous KP chain with stiff and soft joints 
distributed according to (0 and probabilities from a PBD cal- 
culation. Also shown is a 1/T reference curve (dotted line), 
and the melting fraction (dashed line, right y-axis) obtained 
from the PBD calculation. 

statistical physics of the base-pair fluctuational openings. 
Three comments are in order here. 

First, models which do not include fluctuational open- 
ings lead to a persistence length that varies as 1/T, in 
significant disagreement with experiments. The data of 
Ref. Q show that, in the 5 - 42° C range the observed 
variation is 1.45 times bigger than predicted by an 1/T 
law. 

The second comment concerns compatibility of our re- 
sults with the low densities (< 10 -5 ) of fluctuational 
openings and the associated long lifetimes (~ 10 msec) 
of base pairs measured by imino proton exchange ||. 
The results of Ref. Q show very clearly that the ac- 
tivation enthalpy for the opening, which stays roughly 
constant in the — 25°C range, increases significantly 
above 25° C. In other words there is an acceleration of 
the exponential rate at which lifetimes decrease with in- 
creasing temperature. A straightforward extrapolation 
of data appearing in Fig. 12 of [8|b] suggests an increase 
by a factor of 10 between 25° C and 40° C. Thus, events 
considered as exceptional at room temperature become 
more plausible at a temperature close to physiological 
temperature. Moreover it should be noted that relatively 
few disruptions are sufficient to significantly change the 
persistence length. Perhaps the most striking indirect ev- 
idence of the role of small numbers of fluctuational open- 
ings in the premelting regime of genomic DNA comes 
from viscosity measurements j22j . 

The third comment refers to possible extensions and 
applications of the present work. The approach used 
here, owing to its computational efficiency, can be ex- 
ploited in its present form for a systematic study of se- 



quence effects on local flexibility 23[. On the other hand, 
it is likely that extended soft regions of the DNA molecule 
may require a less minimalist approach than the one pre- 
sented here. In particular, it is possible that a more com- 
plex interplay between the torsional, bending, and local 
melting [24j effects may determine the details of the pre- 
melting regime of cyclization experiments; as more ex- 
perimental data become available, generalizations of the 
present work in that direction may become fruitful. 

As a final remark, it may be noted that the quantita- 
tive link between fluctuational openings and local flexi- 
bility explored in this work provides further evidence that 
the dynamical character of DNA should not be underes- 
timated, even away from its melting transition. 

N. T. acknowledges support by the program Accucil- 
Pro of Region Rhone- Alpcs. 
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